Search Result

Select

Time-incorporated point-of-interest collaborative recommendation algorithm

BAO Xuan, CHEN Hongmei, XIAO Qing

Journal of Computer Applications 2021, 41 (8): 2406-2411. DOI: 10.11772/j.issn.1001-9081.2020101565

Abstract （446）

PDF （886KB）（335）

Save

Point-Of-Interest (POI) recommendation aims to recommend places that users do not visit but may be interested in, which is one of the important location-based services. In POI recommendation, time is an important factor, but it is not well considered in the existing POI recommendation models. Therefore, the Time-incorporated User-based Collaborative Filtering POI recommendation (TUCF) algorithm was proposed to improve the performance of POI recommendation by considering time factor. Firstly, the users' check-in data of Location-Based Social Network (LBSN) was analyzed to explore the time relationship of users' check-ins. Then, the time relationship was used to smooth the users' check-in data, so as to incorporate time factor and alleviate data sparsity. Finally, according to the user-based collaborative filtering method, different POIs were recommended to the users at different times. Experimental results on real check-in datasets showed that compared with the User-based collaborative filtering (U) algorithm, TUCF algorithm had the precision and recall increased by 63% and 69% respectively, compared with the U with Temporal preference with smoothing Enhancement (UTE) algorithm, TUCF algorithm had the precision and recall increased by 8% and 12% respectively. And TUCF algorithms reduced the Mean Absolute Error (MAE) by 1.4% and 0.5% respectively, compared with U and UTE algorithms.

Reference | Related Articles | Metrics

Select

Hyperspectral band selection based on multi-kernelized fuzzy rough set and grasshopper optimization algorithm

ZHANG Wu, CHEN Hongmei

Journal of Computer Applications 2020, 40 (5): 1425-1430. DOI: 10.11772/j.issn.1001-9081.2019101769

Abstract （425）

PDF （626KB）（316）

Save

Band selection can effectively reduce the spatial redundancy of hyperspectral data and provide effective support for subsequent classification. Multi-kernel fuzzy rough set model is able to analyze numerical data containing uncertainty and approximate description, and grasshopper optimization algorithm can solve optimization problem with strong exploration and development capabilities. Multi-kernelized fuzzy rough set model was introduced into hyperspectral uncertainty analysis modeling, grasshopper optimization algorithm was used to select the subset of bands, then a hyperspectral band selection algorithm based on multi-kernel fuzzy rough set and grasshopper optimization algorithm was proposed. Firstly, the multi-kernel operator was used to measure the similarity in order to improve the adaptability of the model to data distribution. The correlation measure of bands based on the kernel fuzzy rough set was determined, and the correlation between bands was measured by the lower approximate distribution of ground objects at different pixel points in fuzzy rough set. Then, the band dependence, band information entropy and band correlation were considered comprehensively to define the fitness function of band subset. Finally, with J48 and K-Nearest Neighbor ( KNN) adopted as the classifier algorithms, the proposed algorithm was compared with Band Correlation Analysis (BCA) and Normalized Mutual Information (NMI) algorithms in the classification performance on a common hyperspectral dataset Indiana Pines agricultural area. The experimental results show that the proposed algorithm has the overall average classification accuracy increased by 2.46 and 1.54 percentage points respectively when fewer bands are selected.

Reference | Related Articles | Metrics

Select

Hyperspectral band selection algorithm based on kernelized fuzzy rough set

ZHANG Wu, CHEN Hongmei

Journal of Computer Applications 2020, 40 (1): 258-263. DOI: 10.11772/j.issn.1001-9081.2019071211

Abstract （338）

PDF （959KB）（228）

Save

In order to reduce the redundancy between hyperspectral band images, decrease the computing time and facilitate the following classification task, a hyperspectral band selection algorithm based on kernelized fuzzy rough set was proposed. Due to strong similarity between adjacent bands of hyperspectral images, the kernelized fuzzy rough set theory was introduced to measure the importance of bands more effectively. Considering the distribution characteristics of categories in the bands, the correlation between bands was defined according to the distribution of the lower approximate set of bands, and then the importance of bands was defined by combining the information entropy of bands. The search strategy of maximum correlation and maximum importance was used to realize the band selection of hyperspectral images. Finally, experiments were conducted on the commonly used hyperspectral dataset Indiana Pines agricultural area by using the J48 and KNN classifiers. Compared with other hyperspectral band selection algorithms, this algorithm has overall average classification accuracy increased by 4.5 and 6.6 percentage points respectively with two classifiers. The experimental results show that the proposed algorithm has some advantages in hyperspectral band selection.

Reference | Related Articles | Metrics

Select

NIBoost: new imbalanced dataset classification method based on cost sensitive ensemble learning

WANG Li, CHEN Hongmei, WANG Shengwu

Journal of Computer Applications 2019, 39 (3): 629-633. DOI: 10.11772/j.issn.1001-9081.2018071598

Abstract （494）

PDF （858KB）（359）

Save

The problem of misclassification of minority class samples appears frequently when classifying massive amount of imbalanced data in real life with traditional classification algorithms, because most of these algorithms only suit balanced class distribution or samples with same misclassification cost. To overcome this problem, a classification algorithm for imbalanced dataset based on cost sensitive ensemble learning and oversampling-New Imbalanced Boost (NIBoost) was proposed. Firstly, the oversampling algorithm was used to add a certain number of minority samples to balance the dataset in each iteration, and the classifier was trained on the new dataset. Secondly, the classifier was used to classify the dataset to obtain the predicted class label of each sample and the classification error rate of the classifier. Finally, the weight coefficient of the classifier and new weight of each sample were calculated according to the classification error rate and the predicted class labeles. Experimental results on UCI datasets with decision tree and Naive Bayesian used as weak classifier algorithm show that when decision tree was used as the base classifier of NIBoost, compared with RareBoost algorithm, the F-value is increased up to 5.91 percentage points, the G-mean is increased up to 7.44 percentage points, and the AUC is increased up to 4.38 percentage points. The experimental results show that the proposed algorithm has advantages on imbalanced data classification problem.

Reference | Related Articles | Metrics

Select

Domain-driven high utility co-location pattern mining method

JIANG Wanguo, WANG Lizhen, FANG Yuan, CHEN Hongmei

Journal of Computer Applications 2017, 37 (2): 322-328. DOI: 10.11772/j.issn.1001-9081.2017.02.0322

Abstract （561）

PDF （1053KB）（611）

Save

A spatial co-location pattern represents a subset of spatial features whose instances are frequently located together in spatial neighborhoods. The existing interesting metrics for spatial co-location pattern mining do not take account of the difference between features and the diversity between instances belonging to the same feature. In addition, using the traditional data-driven spatial co-location pattern mining method, the mining results often contain a lot of useless or uninteresting patterns. In view of the above problems, firstly, a more general study object-spatial instance with utility value was proposed, and the Utility Participation Index (UPI) was defined as the new interesting metric of the spatial high utility co-location patterns. Secondly, the domain knowledge was formalized into three kinds of semantic rules and applied to the mining process, and a new domain-driven iterative mining framework was put forward. Finally, by the extensive experiments, the differences between mined results with different interesting metrics were compared in two aspects of utility ratio and frequency, as well as the changes of the mining results after taking the domain knowledge into account. Experimental results show that the proposed UPI metric is a more reasonable measure in consideration of both frequency and utility, and the domain-driven mining method can effectively find the co-location patterns that users are really interested in.

Reference | Related Articles | Metrics

Select

Design and implementation of UDP-based terminal adaptive protocol

WANG Bin CHEN Hongmei ZHANG Baoping

Journal of Computer Applications 2013, 33 (04): 943-946. DOI: 10.3724/SP.J.1087.2013.00943

Abstract （671）

PDF （651KB）（447）

Save

Aiming at terminal performance bottleneck among current data transfer process, a UDP-based terminal adaptive protocol was proposed. After the analysis and the comparison of many factors which affected terminal performance, this protocol viewed both the previous packet loss ratio and the current one as congestion detection parameters. It employed various rate adaption methods such as finite loop counter and process scheduling function in order to balance performance differences in real-time and ensured reliable and fast data transfer. Compared with traditional idle Automatic Repeat reQuest (ARQ) method, the average delay is reduced by more than 25%. The experimental results show that the proposed algorithm has the features of strong real-time, quick response, and it is compatible with large amount of data transmission, especially suitable for small amount of data transmission in engineering applications.

Reference | Related Articles | Metrics